AITopics | catastrophic risk

Collaborating Authors

catastrophic risk

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Symbolic Doomsday Clock moves closer to midnight amid 'catastrophic risks'

Al JazeeraJan-28-2026, 05:03:15 GMT

The world is closer than ever to destruction, scientists have said, as the Doomsday Clock was set at 85 seconds to midnight for 2026, the gloomiest assessment of humanity's prospects since the beginning of the tradition in 1947. The Bulletin of the Atomic Scientists, a not-for-profit organisation founded by Albert Einstein and other scientists, warned in its annual assessment on Tuesday that international cooperation is going backwards on nuclear weapons, climate change and biotechnology, while artificial intelligence poses new threats. "The Doomsday Clock's message cannot be clearer. Catastrophic risks are on the rise, cooperation is on the decline, and we are running out of time," said Alexandra Bell, the president and CEO of the Bulletin of the Atomic Scientists. In a more detailed statement explaining the reasoning for moving the clock closer to midnight, the bulletin expressed concerns that countries including Russia, China, and the United States were becoming "increasingly aggressive, adversarial, and nationalistic".

artificial intelligence, assessment, symbolic doomsday clock move closer, (9 more...)

Al Jazeera

Country:

North America > United States (1.00)
Asia (1.00)

Industry:

Energy (1.00)
Government > Military (0.72)

Technology: Information Technology > Artificial Intelligence (0.77)

Add feedback

How California's New AI Law Protects Whistleblowers

TIME - TechOct-8-2025, 13:48:44 GMT

Booth is a reporter at TIME. Governor Gavin Newsom speaks at Google about preparing students and workers for the next generation of technology, in San Francisco, California, on August 7, 2025. Governor Gavin Newsom speaks at Google about preparing students and workers for the next generation of technology, in San Francisco, California, on August 7, 2025. Booth is a reporter at TIME. CEOs of the companies racing to build smarter AI--Google DeepMind, OpenAI, xAI, and Anthropic--have been clear about the stakes.

california, catastrophic risk, protection, (13 more...)

TIME - Tech

Country: North America > United States > California > San Francisco County > San Francisco (0.97)

Industry:

Law (1.00)
Government (0.73)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.36)

Add feedback

Quantifying Risks in Multi-turn Conversation with Large Language Models

Wang, Chengxiao, Chaudhary, Isha, Hu, Qian, Ruan, Weitong, Gupta, Rahul, Singh, Gagandeep

arXiv.org Artificial IntelligenceOct-7-2025

Large Language Models (LLMs) can produce catastrophic responses in conversational settings that pose serious risks to public safety and security. Existing evaluations often fail to fully reveal these vulnerabilities because they rely on fixed attack prompt sequences, lack statistical guarantees, and do not scale to the vast space of multi-turn conversations. In this work, we propose QRLLM, a novel, principled Certification framework for Catastrophic risks in multi-turn Conversation for LLMs that bounds the probability of an LLM generating catastrophic responses under multi-turn conversation distributions with statistical guarantees. We model multi-turn conversations as probability distributions over query sequences, represented by a Markov process on a query graph whose edges encode semantic similarity to capture realistic conversational flow, and quantify catastrophic risks using confidence intervals. We define several inexpensive and practical distributions: random node, graph path, adaptive with rejection. Our results demonstrate that these distributions can reveal substantial catastrophic risks in frontier models, with certified lower bounds as high as 70\% for the worst model, highlighting the urgent need for improved safety training strategies in frontier LLMs.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2510.03969

Country: North America > United States > Illinois (0.14)

Genre: Research Report > New Finding (0.86)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (0.67)
Government > Military (0.67)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

If Anyone Builds it, Everyone Dies review – how AI could kill us all

The GuardianSep-22-2025, 06:01:04 GMT

W hat if I told you I could stop you worrying about climate change, and all you had to do was read one book? Great, you'd say, until I mentioned that the reason you'd stop worrying was because the book says our species only has a few years before it's wiped out by superintelligent AI anyway. We don't know what form this extinction will take exactly - perhaps an energy-hungry AI will let the millions of fusion power stations it has built run hot, boiling the oceans. Maybe it will want to reconfigure the atoms in our bodies into something more useful. There are many possibilities, almost all of them bad, say Eliezer Yudkowsky and Nate Soares in If Anyone Builds It, Everyone Dies, and who knows which will come true.

eliezer yudkowsky and nate soare, soare, yudkowsky and soare, (9 more...)

The Guardian

Country:

North America > United States (0.17)
Oceania > Australia (0.05)
North America > Mexico (0.05)
Europe > Ukraine > Kyiv Oblast > Chernobyl (0.05)

Industry:

Government (1.00)
Leisure & Entertainment > Sports (0.71)
Health & Medicine (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.51)
Information Technology > Communications > Social Media (0.49)

Add feedback

Dimensional Characterization and Pathway Modeling for Catastrophic AI Risks

Chin, Ze Shen

arXiv.org Artificial IntelligenceAug-11-2025

Although discourse around the risks of Artificial Intelligence (AI) has grown, it often lacks a comprehensive, multidimensional framework, and concrete causal pathways mapping hazard to harm. This paper aims to bridge this gap by examining six commonly discussed AI catastrophic risks: CBRN, cyber offense, sudden loss of control, gradual loss of control, environmental risk, and geopolitical risk. First, we characterize these risks across seven key dimensions, namely intent, competency, entity, polarity, linearity, reach, and order. Next, we conduct risk pathway modeling by mapping step-by-step progressions from the initial hazard to the resulting harms. The dimensional approach supports systematic risk identification and generalizable mitigation strategies, while risk pathway models help identify scenario-specific interventions. Together, these methods offer a more structured and actionable foundation for managing catastrophic AI risks across the value chain.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2508.06411

Country:

North America > United States (1.00)
Asia (0.93)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.28)

Genre: Research Report (1.00)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Against racing to AGI: Cooperation, deterrence, and catastrophic risks

Dung, Leonard, Hellrigel-Holderbaum, Max

arXiv.org Artificial IntelligenceJul-30-2025

AGI Racing is the view that it is in the self-interest of major actors in AI development, especially powerful nations, to accelerate their frontier AI development to build highly capable AI, especially artificial general intelligence (AGI), before competitors have a chance. We argue against AGI Racing. First, the downsides of racing to AGI are much higher than portrayed by this view. Racing to AGI would substantially increase catastrophic risks from AI, including nuclear instability, and undermine the prospects of technical AI safety research to be effective. Second, the expected benefits of racing may be lower than proponents of AGI Racing hold. In particular, it is questionable whether winning the race enables complete domination over losers. Third, international cooperation and coordination, and perhaps carefully crafted deterrence measures, constitute viable alternatives to racing to AGI which have much smaller risks and promise to deliver most of the benefits that racing to AGI is supposed to provide. Hence, racing to AGI is not in anyone's self-interest as other actions, particularly incentivizing and seeking international cooperation around AI issues, are preferable.

artificial intelligence, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2507.21839

Country: North America > United States (0.92)

Genre: Research Report (0.50)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)
Government > Military (1.00)

Technology:

Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Artificial Intelligence > Natural Language (0.70)

Add feedback

Military AI Cyber Agents (MAICAs) Constitute a Global Threat to Critical Infrastructure

Dubber, Timothy, Lazar, Seth

arXiv.org Artificial IntelligenceJun-17-2025

This paper argues that autonomous AI cyber-weapons - Military-AI Cyber Agents (MAICAs) - create a credible pathway to catastrophic risk. It sets out the technical feasibility of MAICAs, explains why geopolitics and the nature of cyberspace make MAICAs a catastrophic risk, and proposes political, defensive-AI and analogue-resilience measures to blunt the threat.

large language model, machine learning, natural language, (13 more...)

arXiv.org Artificial Intelligence

2506.12094

Country:

North America > United States (0.46)
Europe > United Kingdom > England (0.28)

Genre: Research Report (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Military > Cyberwarfare (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(4 more...)

Add feedback

What Is AI Safety? What Do We Want It to Be?

Harding, Jacqueline, Kirk-Giannini, Cameron Domenico

arXiv.org Artificial IntelligenceMay-6-2025

The field of AI safety seeks to prevent or reduce the harms caused by AI systems. A simple and appealing account of what is distinctive of AI safety as a field holds that this feature is constitutive: a research project falls within the purview of AI safety just in case it aims to prevent or reduce the harms caused by AI systems. Call this appealingly simple account The Safety Conception of AI safety. Despite its simplicity and appeal, we argue that The Safety Conception is in tension with at least two trends in the ways AI safety researchers and organizations think and talk about AI safety: first, a tendency to characterize the goal of AI safety research in terms of catastrophic risks from future systems; second, the increasingly popular idea that AI safety can be thought of as a branch of safety engineering. Adopting the methodology of conceptual engineering, we argue that these trends are unfortunate: when we consider what concept of AI safety it would be best to have, there are compelling reasons to think that The Safety Conception is the answer. Descriptively, The Safety Conception allows us to see how work on topics that have historically been treated as central to the field of AI safety is continuous with work on topics that have historically been treated as more marginal, like bias, misinformation, and privacy. Normatively, taking The Safety Conception seriously means approaching all efforts to prevent or mitigate harms from AI systems based on their merits rather than drawing arbitrary distinctions between them.

large language model, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2505.02313

Genre: Research Report (1.00)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)
Government (1.00)
Media (0.66)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.68)

Add feedback

Threshold Crossings as Tail Events for Catastrophic AI Risk

Perrier, Elija

arXiv.org Artificial IntelligenceMar-25-2025

We analyse circumstances in which bifurcation-driven jumps in AI systems are associated with emergent heavy-tailed outcome distributions. By analysing how a control parameter's random fluctuations near a catastrophic threshold generate extreme outcomes, we demonstrate in what circumstances the probability of a sudden, large-scale, transition aligns closely with the tail probability of the resulting damage distribution. Our results contribute to research in monitoring, mitigation and control of AI systems when seeking to manage potentially catastrophic AI risk.

artificial intelligence, arxiv preprint arxiv, probability, (13 more...)

arXiv.org Artificial Intelligence

2503.18979

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.05)
North America > United States > California > San Francisco County > San Francisco (0.04)

Genre: Research Report (0.70)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

Inside France's Effort to Shape the Global AI Conversation

TIME - TechFeb-6-2025, 15:20:42 GMT

One evening early last year, Anne Bouverot was putting the finishing touches on a report when she received an urgent phone call. It was one of French President Emmanuel Macron's aides offering her the role as his special envoy on artificial intelligence. The unpaid position would entail leading the preparations for the France AI Action Summit--a gathering where heads of state, technology CEOs, and civil society representatives will seek to chart a course for AI's future. Set to take place on Feb. 10 and 11 at the presidential Élysée Palace in Paris, it will be the first such gathering since the virtual Seoul AI Summit in May--and the first in-person meeting since November 2023, when world leaders descended on Bletchley Park for the U.K.'s inaugural AI Safety Summit. After weighing the offer, Bouverot, who was at the time the co-chair of France's AI Commission, accepted. But France's Summit won't be like the others.

bouverot, france, intelligence, (15 more...)

TIME - Tech

Country:

Europe > France (1.00)
Europe > United Kingdom > England > Buckinghamshire > Milton Keynes (0.25)
Asia > South Korea > Seoul > Seoul (0.25)
(4 more...)

Genre:

Research Report (0.46)
Personal (0.46)

Industry:

Government > Regional Government > Europe Government > France Government (1.00)
Government > Regional Government > North America Government > United States Government (0.69)

Technology:

Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.96)
Information Technology > Artificial Intelligence > Natural Language (0.73)

Add feedback